Audio-visual classification of Swedish phonemes for pronunciation training

نویسندگان

  • Hedvig Kjellström
  • Olov Engwall
  • Sherif Abdou
  • Olle Bälter
چکیده

We present a method for audio-visual classification of Swedish phonemes, to be used in computer-assisted pronunciation training. The probabilistic kernel-based method is applied to the audio signal and/or either a principal or an independent component (PCA or ICA) representation of the mouth region in video images. We investigate which representation (PCA or ICA) that may be most suitable and the number of components required in the base, in order to be able to automatically detect pronunciation errors in Swedish from audio-visual input. Experiments performed on one speaker show that the visual information help avoiding classification errors that would lead to gravely erroneous feedback to the user; that it is better to perform phoneme classification on audio and video seperately and then fuse the results, rather than combining them before classification; and that PCA outperforms ICA for few components.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audio-visual phoneme classification for pronunciation training applications

We present a method for audio-visual classification of Swedish phonemes, to be used in computer-assisted pronunciation training. The probabilistic kernel-based method is applied to the audio signal and/or either a principal or an independent component (PCA or ICA) representation of the mouth region in video images. We investigate which representation (PCA or ICA) that may be most suitable and t...

متن کامل

On the Efficacy of a Communicative Framework in Teaching English Phonological Features Absent in Persian to Iranian EFL Learners

Although Persian and English share many common phonemes, there are some phonological features that are present in English but absent in Persian which tend to lead to mispronunciation on the part of Persian learners of English, mostly through negative transfer. The present research assesses the efficacy of a communicative framework in improving Iranian adult EFL learners’ pronunciation of five E...

متن کامل

Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes

This paper describes how seven French subjects change their pronunciation and articulation when practising Swedish words with a computer-animated virtual teacher. The teacher gives feedback on the user’s pronunciation with audiovisual instructions suggesting how the articulation should be changed. A wizard-of-Oz set-up was used for the training session, in which a human listener choose the adeq...

متن کامل

A System Demonstration of a Framework for Computer Assisted Pronunciation Training

In this paper, we demonstrate a system implementation of a framework for computer assisted pronunciation training for second language learner (L2). This framework supports an iterative improvement of the automatic pronunciation error recognition and classification by allowing integration of annotated error data. The annotated error data is acquired via an annotation tool for linguists. This pap...

متن کامل

Detecting confusable phoneme pairs for Swedish language learners depending on their first language

This paper proposes a paradigm where commonly made segmental pronunciation errors are modeled as pair-wise confusions between two or more phonemes in the language that is being learnt. The method uses an ensemble of support vector machine classifiers with time varying Mel frequency cepstral features to distinguish between several pairs of phonemes. These classifiers are then applied to classify...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007